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Abstract 

We discuss some ways in which topos theory (a branch of category theory) can 
be apphed to interpretative problems in quantum theory and quantum gravity. In 
Section 1, we introduce these problems. In Section 2, we introduce topos theory, 
especially the idea of a topos of presheaves. In Section 3, we discuss several possible 
applications of topos theory to the problems in Section 1. In Section 4, we draw 
some conclusions. 
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1 Introduction 

In this paper, we wish to suggest some possible ways in which the notion of a 'topos' can 
be apphed to physics; specifically, to interpretative problems and foundational issues in 
quantum theory and quantum gravity. The first of these fields is one to which Marisa 
Dalla Chiara has contributed so much, especially in its logical aspects; so it is a pleasure 
to dedicate to her a paper focused on logical issues. But the second field, quantum 
gravity, also needs to take cognizance of interpretative problems about quantum theory; 
for as we shall describe, research in quantum gravity soon confronts these problems. A 
central theme in this respect is the fundamental dichotomy in quantum theory between 
the traditional instrumentalist interpretation of the theory, and the essentially realist 
view of space and time promulgated by general relativity. Furthermore, we think there 
are some significant ways in which topos theory might be applied in quantum gravity 
proper, not all of which are related directly to the interpretative problems of quantum 
theory. 

In this Section, we shall introduce these two fields. In the next Section, we introduce 
topos theory, especially the idea of a topos of presheaves. In Section 3, we briefiy 
discuss several possible applications of topos theory to the problems in Section 1; we 
have developed in detail elsewhere [|l], 0, Q one of these applications — namely, to the 
issue of assigning values to quantum-theoretic quantities. Finally, in Section 4, we will 
draw some conclusions. 



1.1 The Problem of Realism in Quantum Theory 

Quantum theory has several interpretative problems, about such topics as measurement 
and non-locality; each of which can be formulated in several ways. But workers in the 
field would probably agree that all the problems centre around the relation between — 
on the one hand — the values of physical quantities, and — on the other — the results of 
measurement. For our purposes, it will be helpful to put this in terms of statements: 
so the issue is the relation between "The quantity A has a value, and that value is r" , 
(where r is a real number) and "If a measurement of A is made, the result will be r" . 

In classical physics, this relation is seen as unproblematic. One assumes that, at each 
moment of time: 

(i) every physical quantity has a real number as a value (relative to an appropriate 
choice of units); and 

(ii) one can measure any quantity A 'ideally', i.e. in such a way that the result 
obtained is the value that A possessed before the measurement was made; thus 
"epistemology models ontology" . 

Assumption (i) is implemented mathematically by the representation of quantities as 
real-valued functions on a state space F; so that, in particular, the statement "the value 
of A is r" (r G IR) corresponds to yl~^{r}, the subset of F that is the inverse image of 



the singleton set {r} C IR under the function A : F — > ]R that represents the physical 
quantity A. Thus, in particular, to any state s G F there is associated a 'valuation' (an 
assignment of values) on all quantities, defined by: 

V^{A):=A{s). (1.1) 

More generally, the proposition "the value of A is in A" (where A C IR) corresponds 
to the subset A~^{A) of F; these subsets form a Boolean lattice, which thus provides 
a natural representation of the 'logic' of propositions about the system. In particular, 
corresponding to the real-numbered valuation V^ on quantities, defined by a state s G F, 
we have a {0, l}-valued valuation (a truth- value assignment) to propositions: 

V'{A G A) := 1 if A{s) G A; otherwise V'{A G A) = 0. (1.2) 

Thus, in particular, in classical physics each proposition about the system at some fixed 
time is regarded as being either true or false. 

Note that assumption (ii) is incorporated implicitly in the formalism — namely, in the 
absence of any explicit representation of measurement — by the fact that the function 
y4 : F ^ IR suffices to represent the quantity A, since its values (in the sense of 'values 
of a function') are the possessed values (in the sense of 'values of a physical quantity'), 
and these would be revealed by an (ideal) measurement. 

In quantum theory, on the other hand, the relation between values and results, and 
in particular assumptions (i) and (ii), are notoriously problematic. The state-space is a 
Hilbert space 7i; a quantity A is represented by a self-adjoint operator A (which, with no 
significant loss of generality, we can assume throughout to be bounded), and a statement 
about values "A G A" corresponds naturally to a linear subspace of H (or, equivalently, 
to a spectral projector, E[A E A], oi A). 

Assumption (i) above (the existence of possessed values for all quantities) now fails 
by virtue of the famous Kochen-Specker theorem 0; which says, roughly speaking, that 
provided dim(7f) > 2, one cannot assign real numbers as values to all quantum-theory 
operators in such a way that for any operator A and any function of it f{A) (/ a function 
from IR to IR), the value of f{A) is the corresponding function of the value of A. (On 
the other hand, in classical physics, this constraint, called FUNC, is trivially satisfied 
by the valuations V^'^.) In particular, it is no longer possible to assign an unequivocal 
true-false value to each proposition of the form "A G A" . 

In a strict instrumentalist approach to quantum theory, the non-existence of such 
valuations is of no great import, since this interpretation of the theory deals only with 
the counterf actual assertion of the probabilities of what values would be obtained if 
suitable measurements are made. 

However, strict instrumentalism faces severe problems (not least in quantum gravity); 
and the question arises therefore of whether it may not after all be possible to retain 
some 'realist fiavour' in the theory by, for example, changing the logical structure with 
which propositions about the values of physical quantities are handled. One of our 
claims is that this can indeed be done by introducing a certain topos perspective on the 
Kochen-Specker theorem. 



We will argue for this claim in Section 3.7. For the moment, we just remark that 
no-go theorems like that of Kochen and Specker depend upon the fact that the set of all 
spectral projectors of 7i form a non-Boolean, indeed non-distributive, lattice; suggesting 
a non-Boolean, indeed non-distributive, 'quantum logic'. This alluring idea, originated 
by Birkhoff and von Neumann |^, has been greatly developed in various directions.^ But 
in this connection, the important point to stress for the purposes of this paper is that 
the logic associated with our topos-theoretic proposals is not non-distributive. On the 
contrary, any topos has an associated internal logical structure that is distributive. This 
retention of the distributive law marks a major departure from the dominant tradition 
of quantum logic stemming from Birkhoff and von Neumann. 

On the other hand, our proposals do involve non-Boolean structure since the internal 
logic of a topos is 'intuitionistic', in the sense that the law of excluded middle may not 
hold (although for some toposes, such as the category of sets, it does apply) .0 

1.2 Challenges of Quantum Gravity 

The problem of realism becomes particularly acute in the case of quantum gravity. This 
field is notoriously problematic in comparison with other branches of theoretical physics, 
not just technically but also conceptually. In the first place, there is no clear agreement 
about what the aim of a quantum theory of gravity should be, apart from the broad goal 
of in some way unifying, or reconciling, quantum theory and general relativity. That 
these theories do indeed conflict is clear enough: general relativity is a highly successful 
theory of gravity and spacetime, which treats matter classically (both as a source of 
the gravitational field, and as influenced by it) and treats the structure of spacetime as 
dynamical; while quantum theory provides our successful theories of matter, and treats 
spacetime as a fixed, background structure. 

Much has been written about the conceptual problems that arise in quantum gravity; 
(for one recent survey, cf. |^). But in the present context it suffices to say that these 
are sufficiently severe to cause a number of workers in the field to question many of 
the basic ideas that are implicit in most, if not all, of the existing programmes. For 
example, there have been a number of suggestions that spatio-temporal ideas of classical 
general relativity such as topological spaces, continuum manifolds, space-time geometry, 
micro-causality, etc. are inapplicable in quantum gravity. 

More iconoclastically, one may doubt the applicability of quantum theory itself, 
notwithstanding the fact that all current research programmes in quantum gravity do 
adopt a more-or-less standard approach to quantum theory. In particular, as we shall 
discuss shortly, there is a danger of certain a priori, classical ideas about space and 
time being used unthinkingly in the very formulation of quantum theory; thus leading 



^Cf. Dalla Chiara and Giuntini's masterly recent survey Q. This survey includes recent develop- 
ments that generalize the basic correspondence between subspaces and propositions about values, so 
as to treat so-called 'unsharp' ('operational') quantum physics; on this see also [Q and other papers in 
this issue. 

^Some intuitionistic structures also arise in the dominant 'non-distributive' tradition in quantum 
logic; for example, in the Brouwer-Zadeh approach to unsharp quantum theory; cf. |q|. 



to a type of category error when attempts are made to apply this theory to domains in 
quantum gravity where such concepts may be inappropriate. 

1.3 Whence the Continuum? 

As an example of the adoption by quantum theory of certain problematic concepts, 
we will now consider the use of the continuum — i.e., of real and complex numbers — 
in the formulation of our physical theories in general. And having raised this topic, 
we shall describe in the next Subsection two natural alternative conceptions of space 
and time, which will involve the use of topos theory. (We give this discussion before 
introducing toposes in Section 2, since: (i) it is independent of the logical issues that 
will be emphasised in the rest of this paper; and accordingly, (ii) it can be understood 
without using details of the notion of a topos.) 

So let us ask: why do we use the continuum, i.e., the real numbers, in our physical 
theories? The three obvious answers are: (i) to be the values of physical quantities; (ii) 
to model space and time; and (iii) to be the values of probabilities. But let us pursue a 
little the question of what justifies these answers: we will discuss them in turn. 

• As to (i), the first point to recognize is of course that the whole edifice of physics, 
both classical and quantum, depends upon applying calculus and its higher devel- 
opments (for example, functional analysis and differential geometry) to the values 
of physical quantities. But in the face of this, one could still take the view that 
the success of these physical theories only shows the 'instrumental utility' of the 
continuum — and not that physical quantities really have real-number values. This 
is not the place to enter the general philosophical debate between instrumental- 
ist and realist views of scientific theories; or even the more specific question of 
whether an instrumentalist view about the continuum is committed to somehow 
rewriting all our physical theories without use of H: for example, in terms of ratio- 
nal numbers (and if so, how he should do it!). Suffice it to say here that the issue 
whether physical quantities have real-number values leads into the issue whether 
space itself is modelled using IR. For not only is length one (obviously very im- 
portant!) quantity in physics; also, one main, if not compelling, reason for taking 
other quantities to have real-number values is that results of measuring them can 
apparently always be reduced to the position of some sort of pointer in space — and 
space is modelled using IR. 

We note that the formalism of elementary wave mechanics affords a good example 
of an a priori adoption of the idea of a continuum model of space: indeed, the x 
in iplx) represents space, and in the theory this observable is modelled as having 
a continuous spectrum; in turn, this requires the underlying Hilbert space to be 
defined over the real or complex field. 

• So we turn to (ii): why should space be modelled using IR? More specifically, we 
ask, in the light of our remarks about (i): Can any reason be given apart from the 
(admittedly, immense) 'instrumental utility' of doing so, in the physical theories we 



have so far developed? In short, our answer is No. In particular, we believe there 
is no good a priori reason why space should be a continuum; similarly, mutatis 
mutandis for time. But then the crucial question arises of how this possibility of 
a non-continuum space should be reflected in our basic theories, in particular in 
quantum theory itself, which is one of the central ingredients of quantum gravity. 

As to (iii), why should probabilities be real numbers? Admittedly, if probability 
is construed in terms of the relative frequency of a result in a sequence of mea- 
surements, then real numbers do arise as the limits of infinite sequences of finite 
relative frequencies (which are all rational numbers). But this limiting relative 
frequency interpretation of probability is disputable. In particular, it seems prob- 
lematic in the quantum gravity regime where standard ideas of space and time 
might break down in such a way that the idea of spatial or temporal 'ensembles' 
is inappropriate. 

On the other hand, for the other main interpretations of probability — subjective, 
logical, or propensity — there seems to us to be no compelling a priori reason why 
probabilities should be real numbers. For subjective probability (roughly: what 
a rational agent's minimum acceptable odds, for betting on a proposition, are or 
should be): many authors point out that the use of IR as the values of probabilities 
is questionable, whether as an idealization of the psychological facts, or as a norm 
of rationality. For the logical and propensity interpretations — which are arguably 
more likely to be appropriate for the quantum gravity regime — the use of IR as 
the values of probabilities is less discussed. But again, we see no a priori reason 
for 1R.[] Indeed, we would claim that while no doubt in some cases, one 'degree of 
entailment' or 'propensity' is 'larger' than another, it also seems possible that in 
other cases two degrees of entailment, or two propensities, might be incomparable- 
so that the codomain of the probability-function should be, not a linear order, but 
some sort of partially ordered set (equipped with a sum-operation, so as to make 
sense of the additivity axiom for probabilities). Once again this suggests that a 
fairly radical revision of quantum theory itself might be in order. 



1.4 Alternative Conceptions of Spacetime 

Scepticism about the use of the continuum in present-day physical theories prompts 
one to consider alternative conceptions of space and time. We turn to briefly sketch 
two such conceptions. Both involve topos theory, and indeed raise the idea — even more 
iconoclastic than scepticism about the continuum — that the use of set theory itself may 
be inappropriate for modelling space and time. 



^It seems to us that in the hterature, the principal 'justification' given for IR is the mathematical 
desideratum of securing a uniqueness claim in a representation theorem about axiom systems for qual- 
itative probability; the claim is secured by imposing a continuity axiom that excludes number-fields 
other than IR as the codomain of the representing probability-function. 



1.4.1 From points to regions In standard general relativity — and, indeed, in all 
classical physics — space (and similarly time) is modelled by a set, and the elements of 
that set are viewed as corresponding to points in space. However, if one is 'suspicious 
of points' — whether of spacetime, of space or of time {i.e. instants) — it is natural to 
try and construct a theory based on 'regions' as the primary concept; with 'points' — if 
they exist at all — being relegated to a secondary role in which they are determined by 
the 'regions' in some way (rather than regions being sets of points, as in the standard 
theories) .0 

So far as we know, the first rigorous development of this idea was made in the 
context of foundational studies in the 1920s and 1930s, by authors such as Tarski. The 
idea was to write down axioms for regions from which one could construct points, with 
the properties they enjoyed in some familiar theory such as three-dimensional Euclidean 
geometry. For example, the points were constructed in terms of sequences of regions, 
each contained in its predecessor, and whose 'widths' tended to zero; (more precisely, 
the point might be identified with an equivalence class of such sequences) . The success 
of such a construction was embodied in a representation theorem, that any model of 
the given axiom system for regions was isomorphic to, for example, IR'^ equipped with 
a structured family of subsets, which corresponded to the axiom system's regions. In 
this sense, this line of work was 'conservative': one recovered the familiar theory with 
its points, from a new axiom system with regions as primitives.^ 

But use of regions in place of points need not be 'conservative': one can imagine axiom 
systems for regions, whose models (or some of whose models) do not contain anything 
corresponding to points of which the regions are composed. Indeed, for any topological 
space Z, the family of all open sets can have algebraic operations of 'conjunction', 
'disjunction' and 'negation' defined on them by: O1AO2 := Oin02] O1VO2 := O1UO2; 
and -lO := int{Z — O); and with these operations, the open sets form a complete Heyting 
algebra, also known as a locale. Here, a Heyting algebra is defined to be a distributive 
lattice H, with null and unit elements, that is relatively complemented, which means 
that to any pair 5*1, 5*2 in H, there exists an element 5*1 ^ S2 oi H with the property 



that, for all S G H, 



S <{Si^ S2) if and only if S A Si < S2. (1.3) 



Heyting algebras are thus a generalization of Boolean algebras; they need not obey the 
law of excluded middle, and so provide natural algebraic structures for intuitionistic 
logic. A Heyting algebra is said to be com,plete if every family of elements has a least 
upper bound. Summing up: the open sets of any topological space form a Heyting 
algebra, when partially ordered by set-inclusion; indeed a complete Heyting algebra (a 
locale), since arbitrary unions of open sets are open. 

However, it turns out that not every locale is isomorphic to the Heyting algebra 
of open sets of some topological space; and in this sense, the theory of regions given 

®For time, the natural word is 'intervals', not 'regions'; but we shall use only 'regions', though 
the discussion to follow applies equally to the one-dimensional case — and so to time — as it does to 
higher-dimensional cases, and so to space and spacetime. 

^From the pure mathematical point of view. Stone's representation theorem for Boolean algebras of 
1936 was a landmark for this sort of work. 



by the definition of a locale is not 'conservative' — it genuinely generalizes the idea of 
a topological space, allowing families of regions that are not composed of underlying 
points. 

A far-reaching generalisation of this idea is given by topos theory. As we shall see in 
Section 2.2: (i) in any topos, there is an analogue of the set-theoretic idea of the family 
of subsets of a given set — called the family of subobjects of a given object X; (ii) for 
any object X in any topos, the family of subobjects of X is a locale. 

1.4.2 Synthetic Differential Geometry Recent decades have seen a revival of the 
idea of infinitesimals. Though the idea was heuristically valuable in the discovery and 
development of the calculus, it was expunged in the nineteenth-century rigorization of 
analysis by authors such as Cauchy and Weierstrass — for surely no sense could be made 
of the idea of nilpotent real numbers, i.e., d such that (P = 0, apart from the trivial 
case d = 07 But it turns out that sense can be made of this: indeed in two somewhat 
different ways. 

In the first approach, called 'non-standard analysis', every infinitesimal {i.e., every 
nilpotent d ^ 0) has a reciprocal, so that there are different infinite numbers correspond- 
ing to the different infinitesimals. There were attempts in the 1970s to apply this idea to 
quantum field theory: in particular, it was shown how the different orders of ultra-violet 
divergences that arise correspond to different types of infinite number in the sense of 
non-standard analysis [llO[ . 

However, we wish here to focus on the alternative approach in which we have in- 
finitesimals, but without the corresponding infinite numbers. It transpires that this is 
possible provided we work within the context of a topos; for example, a careful study 
of the proof that the only real number d such that c?^ = is 0, shows that it involves 
the principle of excluded middle, which in general does not hold in the characteristic 



intuitionistic logic of a topos [|TI 



So in this second approach, called 'synthetic differential geometry', infinitesimals 
do not have reciprocals. Applying this approach to elementary real analysis, 'all goes 
smoothly' ! For example, all functions are differentiable, with the linear approximation 
familiar from Taylor's theorem, f{x + d) = f{x) + df'{x), being exact. And in the 
context of synthetic differential geometry, a tangent vector on a manifold A^ is a map 
(more precisely, a 'morphism') from the object D := {d \ d"^ = 0} to Ai. 

Furthermore, one can go on to apply this approach to the higher developments of 
calculus. Indeed, this has already been done by mathematicians; but we shall not try to 
report, let alone sketch, any such applications. 

One crucial question is whether or not there are any physically natural applications 
of synthetic differential geometry to physics; (as against 'merely rewriting' standard 
theories in synthetic terms). We will claim in Section 3 that precisely such an application 
arises in the consistent-histories formulation of quantum theory in the context of a 
continuous time variable. 



2 Presheaves and Related Notions from Topos The- 
ory 

There are various approaches to the notion of a topos but we will focus here on one 
that emphasises the underlying logical structure (as befits a Festschrift for Marisa Dalla 
Chiara!) Also, to keep the discussion simple, we will not develop the full definition of a 
topos — which our discussions of applications in Section 3 will in fact not need. Indeed, 
in this Section we will only discuss one, albeit crucial, clause of the definition of a topos: 
the requirement that a topos contain a 'subobject classifier'. This is a generalization of 
the idea, familiar in set-theory, of characteristic functions. The generalization will turn 
out to have a particularly interesting logical structure in the case of the kind of topos 
to which our discussion in Section 3 is confined: a topos of presheaves. 

A topos is a particular type of category. Very roughly, it is a category that behaves 
much like the category of sets; indeed, this category, which we will call Set, is itself 
a topos. So we will begin by recalling a few fundamental concepts that apply to any 
category (Section 2.1); then we will discuss the idea of a subobject classifier (Section 
2.2); and finally, the ideas of a presheaf, and a topos of presheaves (Section 2.3). 

2.1 Categories 

We recall that a category consists of a collection of objects, and a collection of arrows 
(or morphisms), with the following three properties. (1) Each arrow / is associated 
with a pair of objects, known as its domain (dom /) and the codomain (cod /), and is 
written in the form f : B ^ A where B = dom/ and A = cod/. (2) Given two arrows 
f : B ^ A and g : C ^ B (so that the codomain of g is equal to the domain of /), there 
is a composite arrow f og : C -^ A; and this composition of arrows obeys the associative 
law. (3) Each object A has an identity arrow, id/i : A —^ A, with the properties that for 
all f : B ^' A and all g : A ^ C, id^ ° f = f and g o id^ = g- 

We have already mentioned the prototype category (indeed, topos) Set, in which 
the objects are sets and the arrows are ordinary functions between them (set-maps). In 
many categories, the objects are sets equipped with some type of additional structure, 
and the arrows are functions that preserve this structure (hence the word 'morphism'). 
An obvious algebraic example is the category of groups, where an object is a group, 
and an arrow f : Gi ^ G2 is a group homomorphism from Gi to (j2- (More generally, 
one often defines one category in terms of another; and in such a case, there is often 
only one obvious way of defining composition and identity maps for the new category.) 
However, a category need not have 'structured sets' as its objects. An example (which 
will be prominent in Section 3) is given by any partially-ordered set ('poset') V. It can 
be regarded as a category in which (i) the objects are the elements of V; and (ii) if 
p,q & V, an arrow from p to g is defined to exist if, and only ii, p < q in the poset 
structure. Thus, in a poset regarded as a category, there is at most one arrow between 
any pair of objects p,q eV. 

In any category, an object T is called a terminal (resp. initial) object if for every 



object A there is exactly one arrow f : A ^ T (resp. f : T ^ A). Any two terminal 
(resp. initial) objects are isomorphic|^. So we normally fix on one such object; and we 
write 'the' terminal (resp. initial) object as 1 (resp. 0). An arrow 1 —>■ A is called a 
point, or a global element, of A. For example, applying these definitions to our example 
Set of a category, we find that (i) each singleton set is a terminal object; (ii) the empty 
set is initial; and (iii) the points of A give a 'listing' of the elements of A. 

2.2 Toposes and Subobject Classifiers 

We turn now to introducing a very special kind of category called a 'topos'. As we 
said at the beginning of this Section, we will discuss only one clause of the definition 
of a topos: the requirement that a topos contain a generalization of the set-theoretic 
concept of a characteristic function; this generalization is closely related to what is called 
a 'subobject classifier'. 

Recall that characteristic functions classify whether an element x is in a given subset 
A of a set X by mapping x to 1 ii x & A, and to ii x ^ A. More fully: for any set 
X, and any subset A (1 X, there is a characteristic function xa '■ X ^ {0, 1}, with 
Xa{x) = 1 or according a.s x & A or x ^ A. One thinks of {0, 1} as the truth-values; 
and xa classifies the various x for the set-theoretically natural question, "x G A?". 
Furthermore, the structure of Set — the category of sets — secures the existence of this 
set of truth- values and the various functions xa'- in particular, {0, 1} is itself a set, i.e. 
an object in the category Set, and for each A,X with A O X, xa is an arrow from X 
to {0,1}. 

It is possible to formulate this 'classifying action' of the various xa in general 
category-theoretic terms, so as to give a fruitful generalization. For the purposes of 
this paper, the main ideas are as follows. 

1. In any category, one can define a categorial analogue of the set-theoretic idea of 
subset: it is called a 'subobject'. More precisely, one generalizes the idea that a 
subset A oi X has a preferred injective {i.e., one-to-one) map A ^ X sending 
X G A to X G X. For category theory provides a generalization of injective maps, 
called 'monic arrows' or 'monies'; so that in any category one defines a subobject 
of any object X to be a monic with codomain X. 

2. Any topos is required to have an analogue, written Q, of the set {0, 1} of truth- 
values. That is to say: just as {0, 1} is itself a set — i.e., an object in the category 
Set of sets — so also in any topos, Q is an object in the topos. And just as the set 
of subsets of a given set X corresponds to the set of characteristic functions from 
subsets of X to {0, 1}; so also in any topos, there is a one-to-one correspondence 
between subobjects of an object X, and arrows from X to Q. 

3. In a topos, Q acts as an object of generalized truth-values, just as {0, 1} does in 
set-theory; (though Q typically has more than two global elements). Intuitively, 



^°Two objects A and B in a category are said to be isomorphic if there exists arrows f : A ^ B and 
: B —^ A such that f o g = ids and g o f = id^ 



the elements of Q are the answers to a natural 'multiple-choice question' about the 
objects in the topos, just as "a; G X?" is natural for sets. An example: 

• A set X equipped with a given function a : X -^ X is called an endomap, 
written (X; a); and the family of all endomaps forms a category — indeed, a 
topos — when one defines an arrow from (X; a) to (Y; j3) to be an ordinary 
set-function / between the underlying sets, from X to F, that preserves the 
endomap structure, i.e., f o a = P o f. 

Applying the definition of a subobject, it turns out that a subobject of (X; a) 
is a subset of X that is closed under a, equipped with the restriction of a: 
i.e., a subobject is {Z,a \z), with Z C X and such that a{Z) C Z. So a 
natural question, given x G X and a subendomap {Z,a \z), is: "How many 
iterations of a are needed to send x (or rather its descendant, a{x) or a'^{x) 
or a^{x) . . . ) into Z?" The possible answers are '0 {i.e., x G Zy, '1', '2',. . . , 
and 'infinity {i.e., the descendants never enter Z)'; and if the answer for x is 
some natural number N (resp. 0, infinity), then the answer for a{x) is X — 1 
(resp. 0, infinity). So the possible answers can be presented as an endomap, 
with the elements of the base-set labelled as '0', '1', '2', ..., and 'oo', and with 
the map a acting as follows: a : X i— > X — 1 for X = 1, 2, ..., and a : i-^ 0, 
q; : cxD i-H^ oo. 

And it turns out that this endomap is exactly the object Q in the category of 
endomaps! Recall that in any topos Q is an object in the topos, so that here 
Q must itself be an endomap, a set equipped with a function to itself. 

4. This example suggests that Q is fixed by the structure of the topos concerned. And 
indeed, this is so in the precise sense that, although the clause in the definition of a 
topos that postulates the existence of Q characterizes Q solely in terms of conditions 
on the topos' objects and arrows, Q is provably unique (up to isomorphism). 

Furthermore, in any topos, Q has a natural logical structure. More exactly, Q 
has the internal structure of a Heyting algebra object: the algebraic structure 
appropriate for intuitionistic logic, mentioned in Section 1.4.1. In addition, in any 
topos, the collection of subobjects of any given object X is a complete Heyting 
algebra (a locale). We shall see this sort of Heyting algebra structure in more 
detail in the next Subsection, for the case that concerns us — presheaves. For the 
moment we note only the general point, valid for any topos, that because Q is fixed 
by the structure of the topos concerned, and has a natural Heyting structure, a 
major traditional objection to multi-valued logics — that the exact structure of the 
logic, or associated algebras, seems arbitrary — does not apply here. 



2.3 Toposes of Presheaves 

In preparation for the applications in Section 3, we turn now to the theory of presheaves: 
more precisely, the theory of presheaves on an arbitrary 'small' category C (the qualifica- 
tion 'small' means that the collection of objects in C is a genuine set, as is the collection 
of all Cs arrows). 



To make the necessary definition we recall the idea of a 'functor' between a pair 
of categories C and T>. Broadly speaking, this is a arrow-preserving function from one 
category to the other. The precise definition is as follows. 

Definition 2.1 

• A covariant functor F from a category C to a category T> is a function that assigns 

1. to each C- object A, a V-objectF{A); 

2. to each C-arrow f : B ^ A, a V-arrow F(/) : F{B) -^ F{A) such that 
F(idA) = idF{^); and, if g : C -^ B , and f : B -^ A then 

F(/o^)=F(/)oF(^). (2.1) 

A presheaf (also known as a varying set) on the category C is defined to be a covariant 
functor X from the category C to the category 'Set' of normal sets. We want to make 
the collection of presheaves on C into a category, and therefore we need to define what is 
meant by an 'arrow' between two presheaves X and Y. The intuitive idea is that such 
an arrow from X to Y must give a 'picture' of X within Y. Formally, such an arrow 
is defined to be a natural transformation A^ : X — * Y, by which is meant a family of 
maps (called the components of A^) A^^ : X(y4) -^ Y{A), A an object in C, such that 

if / : A — > i? is an arrow in C, then the composite map X(yl) — ^ Y(y4) — > Y(i?) is 

equal to X(A) — > X(i?) — ^ Y(B). In other words, we have the commutative diagram 

X(v4) ^ X(5) (2.2) 

Na Nb 

Y(f) ^ 

Y{A) -^ Y{B) 

The category of presheaves on C equipped with these arrows is denoted Set . 

We say that K is a subobject of X if there is an arrow in the category of presheaves 
[i.e., a natural transformation) z : K — > X with the property that, for each A, the 
component map z^ : K(y4) -^ X(yl) is a subset embedding, i.e., K(y4) C X(y4). Thus, 
ii f : A —>■ B is any arrow in C, we get the analogue of the commutative diagram Eq. 

K{A) -^ K{B) (2.3) 

X{A) -^ X{B) 

where, once again, the vertical arrows are subset inclusions. 

The category of presheaves on C, Set*^, forms a topos. As we have said, we will not 
need the full definition of a topos; but we do need the idea that a topos has a subobject 
classifier Q, to which we now turn. 



2.3.1 Sieves and the Subobject Classifier in a Topos of Presheaves Among 
the key concepts in presheaf theory — and something of particular importance for this 
paper — is that of a 'sieve', which plays a central role in the construction of the subobject 
classifier in the topos of presheaves on a category C. 

A sieve on an object A in C is defined to be a collection S of arrows f: A ^ B in C 
with the property that ii f: A ^ B belongs to S, and ii g : B -^ C is any arrow, then 
g o f : A ^ C also belongs to S. In the simple case where C is a poset, a sieve on p E C 
is any subset S* of C such that if r G S* then (i) p < r, and (ii) r' E S for all r < r'; in 
other words, a sieve is nothing but a upper set in the poset. 

The presheaf Q : C —^ Set is now defined as follows. If A is an object in C, then n{A) 
is defined to be the set of all sieves on A; and ii f : A ^ B, then 0,{f) : Q,{A) -^ Q{B) 
is defined as 

n{f){S):={h:B^C\hofeS} (2.4) 

for all S en{A). 

For our purposes in what follows, it is important to note that if S* is a sieve on A, 
and ii f : A —^ B belongs to S, then from the defining property of a sieve we have 

n{f){S) := {h : B ^ C \ h o f e S} = {h : B ^ C} =: ^B (2.5) 

where |-B denotes the principal sieve on B, defined to be the set of all arrows in C whose 
domain is B. 

If C is a poset, the associated operation on sieves corresponds to a family of maps 
Qqp : Qp ^ Qq (where Qp denotes the set of all sieves on p in the poset) defined by 
Qgp = fl{ipq) ii ipq : p -^ q {i.e., p < q). It is straightforward to check that if S* G Vtq, 
then 

^qp{S):=]pnS (2.6) 

where jp := {r G C | p < r}. 

A crucial property of sieves is that the set ^[A) of sieves on A has the structure 
of a Heyting algebra. Recall from Section 1.3.1 that this is defined to be a distributive 
lattice, with null and unit elements, that is relatively complemented — which means that 
for any pair 5*1, 5*2 in r2(A), there exists an element 5*1 ^ 5*2 of Q,{A) with the property 
that, for all S en{A), 

S<{Si^ S2) if and only ii S A Si < S2. (2.7) 

Specifically, n{A) is a Heyting algebra where the unit element Irj(A) in ^{A) is the 
principal sieve ^A, and the null element 0j2(^) is the empty sieve 0. The partial ordering 
in fl{A) is defined by Si < S2 if, and only if, 5*1 C 5*2; and the logical connectives are 
defined as: 



S-^ A ^2 := ^1 n ^2 






(2.8) 


S^ V ^2 := ^1 U ^2 






(2.9) 


Si^S2:={f:A- 


^B 1 


for all ^ : fi - 


-^ C ii g f e Si then gofe S2}{2.10) 



As in any Heyting algebra, the negation of an element S (called the pseudo-complement 
of S) is defined as -^S := S ^ 0; so that 

-^S := {f : A ^ B \ for a\\ g : B ^ C, g o f ^ S}. (2.11) 

The main distinction between a Heyting algebra and a Boolean algebra is that, in the 
former, the negation operation does not necessarily obey the law of excluded middle: 
instead, all that be can said is that, for any element S, 

SW^S<1. (2.12) 

It can be shown that the presheaf fi is a subobject classifier for the topos Set''. 
That is to say, subobjects of any object X in this topos {i.e., any presheaf on C) are in 
one-to-one correspondence with arrows x • X — >■ fi. This works as follows. First, let 
K be a subobject of X. Then there is an associated characteristic arrow x^ : X ^ f2, 
whose 'component' Xa '■ -X(^) ~^ ^{A) at each 'stage of truth' A in C is defined as 

X^{x) ■.= {f:A^B I X(/)(x) G K{B)} (2.13) 

for all X G X(A). That the right hand side of Eq. ( p.l3| ) actually is a sieve on A follows 
from the defining properties of a subobject. 

Thus, in each 'branch' of the category C going 'upstream' from the stage A, Xa (^) 
picks out the first member B in that branch for which X(/)(x) lies in the subset K(i?), 
and the commutative diagram Eq. ( p.3|) then guarantees that X(/io/)(a;) will lie in K(C) 
for all h : B ^ C. Thus each 'stage of truth' A in C serves as a possible context for an 
assignment to each x G X(A) of a generalised truth- value: which is a sieve, belonging 
to the Heyting algebra fl{A), rather than an element of the Boolean algebra {0, 1} of 
normal set theory. This is the sense in which contextual, generalised truth-values arise 
naturally in a topos of presheaves. 

There is a converse to Eq. ( |2.13| ): namely, each arrow x : X ^ f2 (i.e., a natural 
transformation between the presheaves X and O) defines a subobject K-^ of X via 

K^{A):=XA'{lniA)}. (2.14) 

at each stage of truth A. 

2.3.2 Global Sections of a Presheaf For the category of presheaves on C, a terminal 
object 1 : C ^ Set can be defined by 1{A) := {*} at all stages A in C; ii f : A —y B is 
an arrow in C then 1(/) : {*} -^ {*} is defined to be the map * h-^ *. This is indeed a 
terminal object since, for any presheaf X, we can define a unique natural transformation 

A^ : X ^ 1 whose components N^ : X(y4) -^ l(^) = {*} are the constant maps x ^—>- * 
for all X eX{A). 

A global element (or point) of a presheaf X is also called a global section. As an 
arrow 7 : 1 — »• X in the topos Set , a global section corresponds to a choice of an 
element ja £ X(^) fo^' each stage of truth A in C, such that, ii f : A —>■ B, the 
'matching condition' 

X(/)(7a)=7b (2.15) 



is satisfied. As we shall see, the Kochen-Specher theorem can be read as asserting the 
non-existence of any global sections of certain presheaves that arises naturally in any 
quantum theory. 



3 Some Presheaves in Quantum Theory and Quan- 
tum Gravity 

Having developed in Section 2 the idea of a topos, especially the idea of a topos of 
presheaves, we wish now to suggest some possible applications in quantum physics. 

There are several natural orders in which to present these examples. For instance, 
one could follow Section I's order of first treating quantum theory, without regard to 
space, time or gravity; and then treating these latter. But we will in fact proceed 
by first giving several examples involving space, time or spacetime, since: (i) in these 
examples, it is especially natural to think of the objects of the presheaf 's base-category 
C as 'contexts' or 'stages' relative to which generalized truth-values are assigned; and 
(ii) these examples will serve as prototypes, in various ways, for later examples. 

3.1 Global reference frames in elementary wave mechanics Throughout clas- 
sical and quantum physics, we are often concerned with reference frames (or coordinate 
systems), the transformations between them, and the corresponding transformations on 
states of a physical system, and on physical quantities. Our first example will present 
in terms of presheaves some familiar material about reference frames in the context of 
non-relativistic wave mechanics. 

Define the category of contexts C to have as its objects global Cartesian reference 
frames e := {e^,e^,e^} (where e*, i = 1,2,3, are vectors in Euclidean 3-space E^ such 
that e* ■ e^ = 6^^), all sharing a common origin; and define C to have as its arrows the 
orthogonal transformations 0{e,e') from one reference frame {e*} to another {e'*}, i.e., 
with a matrix representation e'* = I]j=i e-^0(e, e')*; (so that between any two objects, 
there is a unique arrow). Define a presheaf H as assigning to each object e in C, a 
copy H(e) of the Hilbert space L^(IR^); and to each arrow 0(e,e'), the unitary map 
U{e,e') : H(e) -^ H(e') defined by (f/(e, e')V^)(x) := V^(0(e, e')"^(a;)) (so that f/(e, e') 
represents the action of 0(e, e') as a map from one copy, H(e), of the (pure) state-space 
L^(IR^), to the other copy H(e')). Any given ip G L^(IR^), together with its transforms 
under the various unitary maps f/(e, e'), defines a global section of H. 

Of course, discussions of the transformation of the wave-function under spatial ro- 
tations etc. normally identify the different copies of the state-space L^(IR^); and from 
the viewpoint of those discussions, the above definition of H may seem at first sight 
to make a mountain out of a molehill, particularly since the category of contexts in 
this example is so trivial (for example, the internal logic is just the standard 'true-false' 
logic). But it is a helpful prototype to have in mind when we come to more complex or 
subtle examples. 



For the moment, just note that this definition has the advantage of clearly distin- 
guishing the quantum state at the given time from its representing vectors ip in various 
reference frames. Or rather, to be precise, we need to allow for the fact that the quantum 
state is a yet more abstract notion, also occurring in other representations than wave 
mechanics (position- representation). So the point is: this definition of H distinguishes 
the Schrodinger-picture, wave-mechanical representative of the quantum state at the 
given time — which it takes as a global section of H — from its representing vectors ip 
(elements of the global section at the various 'stages' e). 

3.2 Observers in quantum cosmology One could argue that the example above 
illustrates a contextual aspect of standard quantum theory whereby the concrete rep- 
resentation of an abstract state depends on the observer; at least, this is so if we iden- 
tify reference frames with observers. As we have explained, this contextual aspect is 
not emphasised in standard quantum theory since the different Hilbert spaces associ- 
ated with different observers are all naturally isomorphic (via the unitary operators 
U{e,e') : H(e) -^ H(e')). From a physical perspective, the fact that different observers, 
related by a translation or a rotation of reference frame, see 'equivalent' physics is a 
reflection of the homogeneity and isotropy of physical space. 

However, the situation might well be different in cosmological situations, since the 
existence of phenomena like event and particle horizons means that the physics percep- 
tible from the perspective of one observer may be genuinely different from that seen by 
another. This suggests that any theory of quantum cosmology (or even quantum field 
theory in a fixed cosmological background) may require the use of more than one Hilbert 
space, in a way that cannot be 'reduced' to a single space. 

Of course, it is well known that quantum field theory on a curved spacetime often 
requires more than one Hilbert space, associated with the unavoidable occurrence of 
inequivalent representations of the canonical commutation relations: this is one of the 
reasons for preferring a C*-algebra approach. But what we have in mind is different — for 
example, our scheme could easily be adapted to involve a presheaf of C*-algebras, each 
associated with an 'observer'. 

Evidently, a key question in this context is what is meant by an 'observer'; or, more 
precisely, how this idea should be represented mathematically in the formalism. One 
natural choice might be a time-like curve (in the case of quantum field theory in a 
curved background with horizons), although this does suggest that a 'history' approach 
to quantum theory would be more appropriate than any of the standard ones. Of course, 
in the case of quantum cosmology proper, these issues become far more complex since — 
for example — even what is meant by a 'time-like curve' presumably becomes the subject 
of quantum fluctuations! 

3.3 Unitary time evolution in elementary quantum theory The discussion 
above of different spatial reference frames has a precise temporal analogue. Thus, fix 
once for all a global Cartesian reference frame in E^, and define the base-category of 
contexts C to be the real line H, representing time. That is to say, let the objects of C 



be instants t G IR; and let there be an C-arrow from t to t', f : t —>■ t', if and only if 
t < t'] so there is at most one arrow between any pair of objects t,t' in C. Define the 
presheaf, called H (as in Paragraph 3.1), as assigning to each t a copy of the system's 
Hilbert space Ti.; {Ti. need not be L^(IR^) — here we generalize from wave mechanics). 
Writing this copy as Ht, we have H(t) := Tit- The action of H on C-arrows is defined 
by the Hamiltonian H, via its one-parameter family of unitary exponentiations Ut- If 
f : t ^ t', then H(/) : Tit — > Hf is defined by Uf-t- The action of Uf-t, then represents 
the Schrodinger-picture evolution of the system from time t to t'; and a total history 
of the system (as described in the given spatial coordinate system) is represented by a 
global section of the presheaf H.|^ Note that, as in the example of 3.1, the internal logic 
of this example is essentially trivial. 

We note that a parallel discussion could be given for time evolution in classical 
physics: we would attach a copy of the phase-space F to each t, and a total history of 
the system (as described in the given spatial coordinate system) would be represented 
by a global section of the corresponding presheaf. It transpires that the development 
of such a 'history' approach to classical physics provides a very illuminating perspective 
on the mathematical structures used in the consistent-histories approach to quantum 
theory; for more information see . 



This completes the presentation, in terms of presheaves, of familiar material from or- 
thodox/established theories. From now on, we will present in terms of presheaves some 
ideas that are currently being pursued in research on foundations of quantum theory 
and quantum gravity. 

3.4 Presheaves on causal sets The previous example admits an immediate gener- 
alisation to the theory of causal sets. By a causal set we mean a partially-ordered set 
V whose elements represent spacetime points in a discrete, non-continuum model, and 
in which p < q, with p,q & V, means that q lies in the causal future of p. 

The set P is a natural base category for a presheaf of Hilbert spaces in which the 
Hilbert space at a point p E V represents the quantum degrees of freedom that are 
'localised' at that point/context. From another point of view, the Hilbert space at a 
point p could represent the history of the system (thought of now in a cosmological sense) 
as viewed from the perspective of an observer localised at that point. For a discussion 
of this idea see fl^. The sieve, and hence logical, structure in this example is distinctly 
non-trivial. 

There are several variants on this theme: for example, one may decide that the 
category of contexts should have as its objects 'regions' rather than spacetime points 
(cf. Section 1.4.1 in this regard). 



^^We remark that we could similarly express in terms of presheaves Heisenberg-picture evolution: we 
would instead define a presheaf that assigned to each C-object t a copy of the set i?(7Y) of bounded 
self-adjoint operators on H (or say, a copy of some other fixed set taken as the algebra of observables) , 
and then have the maps Ut induce Heisenberg-picture evolution on the elements of the copies of B{Ti). 



3.5 Presheaves on spatial slices; for topological QFT Topological quantum field 
theory (TQFT) has a very well-known formulation in terms of category theory, and 
it is rather straightforward to see that this extends naturally to give a certain topos 
perspective. 

Recall that in differential topology, two closed n-dimensional manifolds Ei and S2 are 
said to be cobordant if there is a compact n+ 1-manifold, M say, whose boundary dM is 
the disjoint union of Si and S2. In TQFT, the n-dimensional manifolds are interpreted 
as possible models for physical space (so that spacetime has dimension n + 1), and an 
interpolating n + 1-manifold is thought of as describing a form of 'topology change' in 
the context of a (euclideanised) type of quantum gravity theory. In the famous Atiyah 
axioms for TQFT, a Hilbert space Tij: is attached to each spatial n-manifold S, and to 
each cobordism from Si to S2 there is associated a unitary map from ?-^Si to 7is2- 

We note that the collection of all compact n-dimensional manifolds can be regarded 
as the set of objects in a category C, in which the arrows from an object Si to another 
S2 are given by cobordisms from Si to S2. From this perspective, the Atiyah axioms for 
TQFT can be viewed as a statement of the existence of a functor from C to the category 
of Hilbert spaces; indeed, this is how these axioms are usually stated. However, from 
the perspective being developed in the present paper, we see that we can also think of 
C as a 'category of contexts', in which case we have a natural presheaf reformulation of 
TQFT. 

3.6 Consistent histories formalism for quantum theory and continuous time 

In the 'History Projection Operator' (HPO) version of the consistent-histories approach 
to quantum theory, propositions about the history of the system at a finite set of 
time points (ti, t2, . . . , t„) are represented by projection operators on the tensor product 
Hti ® 'Ht2 (8> ■ ■ ■ (S> 7it„ of n copies of the Hilbert space TC associated with the system by 
standard quantum theory. The choice of this particular Hilbert space can be motivated 



in several different ways. For example, the original motivation [Q was a desire to find a 
concrete representation of the temporal logic of such history propositions. From another 
perspective, this Hilbert space can be seen as the carrier of an irreducible representa- 
tion of the 'history group' whose Lie algebra is (on the simplifying assumption that the 
system is a non-relativistic point particle moving in one dimension) 

[xu,xt^] = (3.1) 

[Pu,Pt,] = (3.2) 

[xt^,Pt^] = ihSij (3.3) 

where i,j = 1,2, ...,n, and Xt- (resp. ptj is the Schrodinger-picture operator whose 
spectral projectors represent propositions about the position (resp. momentum) of the 
system at the time tj. 

One advantage of the approach based on equations ( |3.1| -|373|) is that it suggests an 
immediate generalisation to the case of continuous-time histories: namely, the use of the 
history algebra 

[xt,xt>] = (3.4) 



[PupA = (3.5) 

[xt.Pf] = ihT6(t'-t) (3.6) 

where r is a constant with the dimensions of time. 

This continuous-time history algebra has been studied by a variety of authors but 
here we will concentrate on Savvidou's observation |l^ that the notion of 'time' appears 
in two ways that differ in certain significant respects. The main idea is to introduce a 
new time coordinate s G H, and to associate with it a Heisenberg picture defined from 
the time-averaged Hamiltonian H = J dtHt. Thus, in particular, one defines for the 
time-indexed position operator xt 

Xt{s) := exp{isH/h) Xt exp{—isH/h) (3.7) 

This new time s is not a difference in values of t. Rather, if one thinks of assigning 
a copy Tit of the system's (usual) Hilbert space TC to each time t, then s parametrizes 
a Heisenberg-picture motion of quantities withm Tit- Accordingly, t is called 'external 
time', and s is called 'internal time'. 

This formalism has been developed in various ways: in particular, there is a natural, 
dynamics- independent 'Liouville' operator that generates translations in the external 
time parameter. From our topos-theoretic perspective, we note that external time is 
more singular than internal time — as hinted by the delta-functions in t that occur in 
the history algebra's canonical commutation relations. This suggests modelling external 
time, not by the usual real numbers IR, but by the reals 'enriched' with infinitesimals 
in the sense of synthetic differential geometry, and which are related in some way to 
the action of the Liouville operator. As emphasised earlier, this requires a non-standard 
model of the real line: in fact, we have to use a real number object in a topos. 

Note that this use of a topos is quite different from, and in addition to, any develop- 
ment of a consistent-histories analogue of the temporal presheaf introduced in Section 
3.3. In the latter case, the presheaf structure in the consistent-histories theory can ar- 
guably be related to ideas of state reduction of the kind discussed by von Neumann and 
Liiders [|12 . 



3.7 Presheaves of Propositions, and Valuations in Quantum Theory Finally, 
we want to briefly present an application of presheaves that we have developed in detail 
elsewhere [||, |], ^. Namely: the proposal mentioned at the end of Section 1.1, to retain a 
'realist flavour' in the assignment of values to quantum-theoretic quantities, despite no- 
go theorems like the Kochen-Specker theorem, by using the non-Boolean logical structure 
of a topos of presheaves. 

Before stating the proposal precisely, let us motivate it in terms the assumptions (i) 
and (ii) of Section 1.1. As discussed there, in quantum theory, assumption (i), i.e., that 
all quantities have real-number values, fails by virtue of the Kochen-Specker theorem; 
and assumption (ii), that one can measure any quantity ideally, is very problematic, 
involving as it does the notion of measurement. Standard quantum theory, with its 
'eigenvalue-eigenstate link' — that in state ip there is a value only for a quantity of which 



ip is an eigenstate, viz. the eigenvalue — retains assumption (ii) only in the very limited 
sense that «/the quantity A has a value, r say, according to the theory, i.e., the (pure) 
state if) is an eigenvector of A for eigenvalue r, then an ideal measurement of A would 
have result r. But setting aside this very special case, the theory faces the notorious 
'measurement problem': the scarcity of values in the microrealm, due to the eigenvalue- 
eigenstate link, threatens to make the macrorealm indefinite ('Schrodinger's cat'). 

This is of course not the place to review the programmes for solving this problem. 
But it is worth distinguishing two broad approaches to it, which we will call 'Literalism' 
and 'Extra Values'. For our topos-theoretic proposal will combine aspects of these 
approaches. They are: 

1. Literalism. This approach aims to avoid the instrumentalism of standard quantum 
theory, and yet retain its scarcity of values (the eigenvalue-eigenstate link), while 
solving the measurement problem: not by postulating a non-unitary dynamics, 
but by a distinctively interpretative strategy. So far as we know, there are two 
main forms of this approach: Everettian views (where the eigenvalue-eigenstate 
link is maintained 'within a branch'); and those based on quantum logic. 

2. Extra Values. This approach gives up the eigenvalue-eigenstate link; but retains 
standard quantum theory's unitary dynamics for the quantum state. It postulates 
extra values (and equations for their time-evolution) for some quantities. The 
quantities getting these extra values are selected either a priori., as in the pilot-wave 
programme, or by the quantum state itself, as in (most) modal interpretations. 

Our topos-theoretic proposal combines aspects of Literalism and Extra Values. Like 
both these approaches, the proposal is 'realist', not instrumentalist; (though it also 
shares with standard quantum theory, at least in its Bohrian or 'Copenhagen' version, 
an emphasis on contextuality). Like Extra Values (but unlike Literalism), it attributes 
values to quantities beyond those ascribed by the eigenvalue-eigenstate link. Like Lit- 
eralism (but unlike Extra Values), these additional values are naturally defined by the 
orthodox quantum formalism. More specifically: all quantities get additional values (so 
no quantity is somehow 'selected' to get such values); any quantum state defines such a 
valuation, and any such valuation obeys an appropriate version of the FUNC constraint 
mentioned in Section 1.1. The 'trick', whereby such valuations avoid no-go theorems like 
the Kochen-Specker theorem, is that the truth value ascribed to a proposition about the 
value of a physical quantity is not just 'true' or 'false'! 

Thus consider the proposition "A G A" , saying that the value of the quantity A lies 
in a Borel set A C IR. Roughly speaking, any such proposition is ascribed as a truth- 
value a set of coarse-grainings, /(A), of the operator A that represents A. Exactly which 
coarse-grainings are in the truth- value depends in a precise and natural way on A and the 
quantum state ip: in short, f{A) is in the truth- value iff if) is in the range of the spectral 
projector E[f{A) e /(A)]. Note the contrast with the eigenstate-eigenvalue link: our 
requirement is not that if) be in the range oi E[A G A], but a weaker requirement. For 

^^We give a more detailed discussion of these approaches and two others, especially with a view to 
quantum gravity, in Section 2.1 of 0]. 



E[f{A) G /(A)] is a larger spectral projector; i.e., in the lattice C{TC) of projectors on 
the Hilbert space H, E[A e A] < E[f{A) e /(A)]. 

So the intuitive idea is that the new proposed truth-value of "A G A" is given by 
the set of weaker propositions "/(^) ^ /(A)" that are true in the old {i.e., eigenstate- 
eigenvalue link) sense. To put it a bit more exactly: the new proposed truth-value of 
"A G A" is given by the set of quantities f{A) for which the corresponding weaker 
proposition "/(v4) G /(A)" is true in the old {i.e., eigenstate-eigenvalue link) sense. To 
put it less exactly, but more memorably: the new truth-value of a proposition is given 
by the set of its consequences that are true in the old sense. 

We turn to stating the proposal exactly. We first introduce the set O of all bounded 
self-adjoint operators A, B, ... on the Hilbert space ?i of a quantum system. We turn O 
into a category by defining the objects to be the elements of O, and saying that there is an 
arrow from A to 5 if there exists a real- valued function / on cr{A) C IR, the spectrum of 
A, such that B = f{A) (with the usual definition of a function of a self-adjoint operator, 
using the spectral representation), li B = f{A), for some / : cr(A) -^ IR, then the 
corresponding arrow in the category O will be denoted fo '■ A ^ B . 

We next define two presheaves on the category O, called the dual presheaf and 
the coarse- graining presheaf respectively. The former affords an elegant formulation of 
the Kochen-Specker theorem, namely as a statement that the dual presheaf does not 
have global sections. The latter is at the basis of our proposed generalised truth- value 
assignments. 

The dual presheaf on O is the covariant functor D : O ^ Set defined as follows: 

1. On objects: D(y4) is the dual of Wa, where Wa is the spectral algebra of the 
operator A; i.e. Wa is the collection of all projectors onto the subspaces of TC 
associated with Borel subsets of (t{A). That is to say: 0(^4) is defined to be the 
set Hom(iy^, {0, 1}) of all homomorphisms from the Boolean algebra Wa to the 
Boolean algebra {0, 1}. 

2. On arrows: If /o : i ^ 5, so that B = f{A), then D(/o) : D{Wa) -^ D{Wb) is 
defined by 'D{fo){x) '■= xIvk^(a) where x|w/(a) denotes the restriction of x £ D{Wa) 
to the subalgebra M//(a) ^ Wa- 

A global element (global section) of the functor D : (9 — > Set is then a function 7 
that associates to each A & O an element 'Ja of the dual of Wa such that ii fo '■ A ^ B 
(so B = f{A) and Wb C Wa), then 7yi|vKB = 1b- Thus, for all projectors a G Wb C Wa, 

7ij(d) = iA{a). (3.8) 

Since each a in the lattice C{T-C) of projection operators on H belongs to at least one 
such spectral algebra Wa (for example, the algebra {0, 1, a, 1 — a}) it follows from Eq. 
( p.8| ) that a global section of D associates to each projection operator a G C{H) a number 
1^(0;) which is either or 1, and is such that, if dA/3 = 0, then V{a\/ P) = V{a) + V{P). 
In other words, a global section 7 of the presheaf D would correspond to an assignment 
of truth-values {0, 1} to all propositions of the form "A G A", which obeyed the FUNC 



condition Eq. ( |3.8| ). These are precisely the types of valuation prohibited, provided 
that dimTi > 2, by the Kochen-Specker theorem. So an alternative way of expressing 
the Kochen-Specker theorem is that, if dimTi > 2, the dual presheaf D has no global 
sections. 

However, we can use the subobject classifier O in the topos Set'^ of all presheaves 
on O to assign generalized truth- values to the propositions "A G A". These truth- 
values will be sieves, as defined in Section 2.2; and since they will be assigned relative to 
each 'context' or 'stage of truth' A in O, these truth-values will be contextual as well as 
generalized. And as we said at the end of Section 2.2: because in any topos the subobject 
classifier ft is fixed by the structure of the topos, Q is unique up to isomorphism. Thus 
the family of associated truth-value assignments is fixed, and the traditional objection 
to multi-valued logics — that their structure often seems arbitrary — does not apply to 
these generalized, contextual truth-values. 

We first define the appropriate presheaf of propositions. The coarse- graining presheaf 
over O is the covariant functor G : O —* Set defined as follows. 

1. On objects in O: G(A) := Wa-, where Wa is the spectral algebra of A. 

2. On arrows in O: li fo : A ^ B {i.e., B = /(i)), then G(/o) : Wa ^ Wb is 
defined as 

Gifo){E[A E A]) := E[fiA) E /(A)] (3.9) 

(where, if /(A) is not Borel, the right hand side is to be understood in the sense 
of Theorem 4.1 of ^ — a measure-theoretic nicety that we shall not discuss here). 

We call a function u that assigns to each choice of object A in O and each Borel set 
A C cr[A), a sieve of arrows in O on A {i.e., a sieve of arrows with A as domain), a sieve- 
valued valuation on G. We write the values of this function as z/(A E A). (One could 
equally well write z/(£^[y4 G A]), provided one bears in mind that the value depends not 
only on the projector E[A E A], but also on the operator (context) A of whose spectral 
family the projector is considered to be a member.) 

From the logical point of view, a natural desideratum for any kind of valuation on 
a presheaf of propositions such as G is that the valuation should specify a subobject of 
G. For in logic one often thinks of a valuation as specifying the 'selected' or 'winning' 
propositions: in this case, the 'selected' elements E[A E A] in each G{A). So it is 
natural to require that the elements that a valuation 'selects' at the various contexts A 
together define a subobject of G. But we saw in Section 2.3.1 that subobjects are in 
one-one correspondence with arrows, i.e., natural transformations, N : G —>■ fl. So it is 
natural to require a sieve-valued valuation u to define such a natural transformation by 
the equation N'^{E[A E A]) := u{A E A). 

This desideratum leads directly to the analogue for presheaves of the famous func- 
tional composition condition of the Kochen-Specker theorem, called FUNC above: and 
which we will again call FUNC in the setting of presheaves. For it turns out that a 
sieve-valued valuation defines such a natural transformation iff it obeys (the presheaf 
version of) FUNC. 



To spell this out, we first recall that the subobject classifer ft 'pushes along' sieves, 
according to Eq. ( |2.4|) . For the category O, this becomes: ii fo : A ^ B , then fl{fo) '■ 
n{A) -> n{B) is defined by 

n{fo){S) ■.= {ho:B^C\hoofaeS} (3.10) 

for all sieves S G Q{A). 

Accordingly, we say that a sieve- valued valuation z/ on G satisfies generalized func- 
tional composition — for short, FUNC — if for all A,B and fo '■ A —^ B and all E[A G 
A] G G(A), the valuation obeys 

v{B G GU){E[A G A])) = vU{A) G /(A)) = nUo)HA G A)). (3.11) 

FUNC is exactly the condition a sieve-valued valuation must obey in order to 
thus define a natural transformation, i.e., a subobject of G, by the natural equation 
N'XiElA G A]) := v{A G A). That is: A sieve-valued valuation z/ on G obeys FUNC if 
and only if the functions at each 'stage of truth' A 

Nl{E[A G A]) := v{A G A) (3.12) 

define a natural transformation N^ from G to 17. 

It turns out that with any quantum state there is associated such a Ff/A/"C-obeying 
sieve-valued valuation. Furthermore, this valuation gives the natural generalization of 
the eigenvalue-eigenstate link described at the start of this Subsection. That is, a quan- 
tum state if) induces a sieve on each AinOhy the requirement that an arrow fo'-A^B 
is in the sieve iff ip is in the range of the spectral projector E[B G /(A)]. To be precise, 
we define for any ip, and any A a Borel subset of the spectrum o'{A) of A: 

u^iAEA) := {fo:A^B\E[BefiA)]i; = i;} 

= {/o:i^fi|Prob(i?G/(A);^) = l} (3.13) 

where Prob(i? G /(A);?/)) is the usual Born-rule probability that the result of a mea- 
surement of B will lie in /(A), given the state ip. 

This definition generalizes the eigenstate-eigenvalue link, in the sense that we require 
not that ip be in the range oi E[A G A], but only that it be in the range of the larger 
projector E[f{A) G /(A)]. One can check that the definition satisfies FUNC, and also 
has other properties that it is natural to require of a valuation (discussed in |1|, |^, |^). 

Finally, we note that furthermore, FUNC and these other properties are enjoyed by 
the exactly analogous definition of a sieve- valued valuation u'' associated with a density 
matrix p. One defines: 

u^iAEA) := {/a:i^fi|Prob(i?G/(A);p) = l} 

= {fo:A^B\tT{pE[BEf{A)]) = l}. (3.14) 



4 Conclusion 

In this paper we have suggested that topos-theoretic notions, in particular the idea of 
a topos of presheaves on a base-category C of suitably chosen 'contexts', may well be 
useful both in the foundations of quantum theory, and in quantum gravity. Of course, 
much remains to be done in applying these notions to the research areas listed in Section 
3. Even for the last area listed (Section 3.7), where the application has been worked 
out in detail, there are many further natural questions to investigate: for example, one 
might consider how standard topics such as the uncertainty relations or non-locality 
appear in this framework. But we hope that such an open-ended situation — even in the 
area of logical, rather than physical, theorizing — might please someone so active and 
enthusiastic about logical and physical research as Marisa Dalla Chiara! 
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